AMENDMENT TO THE CLAIMS 
Please amend the presently pending claims as follows: 

1. (Currently Amended) A method of identifying a caller of a call from the caller to a 
recipient, the method comprising: 

(a) receiving a voice input from the caller; 

(b) applying characteristics of the voice input to a plurality of acoustic models, which 

comprises a generic acoustic model and acoustic models of any previously 
identified callers, to obtain a plurality of respective acoustic scores , wherein the 
generic acoustic model comprises caller independent models of a plurality of 
speech units ; 

(c) identifying the caller as one of the previously identified callers or a new caller based 

on the plurality of acoustic scores; and 

(d) if the caller is identified as a new caller in step (c), generating a new acoustic model 
for the new calle r, which is sp e cific to the n e w call e r from the caller-independent 
models of the generic acoustic model and modifying the caller-independent 
models of the speech units that are included in the voice input to represent the 
characteristics of the voice input received from the new caller such that the new 
acoustic model for the new caller and the acoustic models of any previously 
identified callers comprise the same plurality of speech units . 

The method of claim 1 wherein identifying in (c) comprises: 
identifying the caller as one of the previously identified callers if the acoustic score 
for the respective acoustic model is better than the acoustic score for the generic 
acoustic model; and 

identifying the caller as a new caller if the acoustic score for the generic acoustic 
model is better than the acoustic scores for the acoustic models of the plurality of 
previously identified callers. 



2.(Original) 
(c)(1) 

(c)(2) 



3. (Previously Presented) The method of claim 1 wherein: 

step (a) comprises segmenting the voice input into a sequence of recognized speech units 

using the generic acoustic model; 
each of the plurality of acoustic models comprises models of the speech units segmented 

in step (a); and 

step (b) comprises applying the characteristics of the voice input to a sequence of the 
models of the speech units segmented in step (a) for the plurality of acoustic 
models. 

4. (Currently Amended) The method of claim 1 wh e rein each of th e plurality of acoustic 
mod e ls comprises models of spe e ch units and wherein the method further comprises: 

(e) if the caller is identified as one of the previously identified callers in step (c), updating 
the respective acoustic model for the previously identified caller by modifying the 
models of the speech units that are included in the voice input, based on the 
characteristics of the voice input. 

5. (Original) The method of claim 4 wherein step (e) comprises modifying the models of the 
speech units that are included in the voice input based on as little as a single utterance. 

6. (Original) The method of claim 1 and further comprising: 

(e) storing the new acoustic model in an acoustic model repository with the plurality of 
acoustic models such that the new acoustic model becomes one of the plurality of 
acoustic models in step (b) and the new caller is included as a previously 
identified caller. 



7. (Canceled). 



8. (0riginal) The method of claim 1 wherein steps (a) through (c) are performed without 
alerting the caller during the call that the caller is being identified. 

9. (Original) The method of claim 1 wherein: 

step (b) comprises splitting the voice input into subsections and applying the 
characteristics of each subsection to the plurality of acoustic models to obtain a 
plurality of respective acoustic scores that represent how well the characteristics 
in each subsection match the respective acoustic models; and 

step (c) comprises, for each subsection, identifying the acoustic model having the best 
acoustic score for that subsection and identifying the caller as one of the 
previously identified callers only if the best acoustic scores for all subsections 
correspond to the same previously identified caller. 

10. (Original) The method of claim 1 and further comprising: 

(e) maintaining a caller-specific language model for each of the previously identified 

callers based on the voice inputs from those callers; 

(f) applying the characteristics of the voice input to the generic acoustic model and each 

of the caller-specific language models to produce a plurality of recognized speech 
unit sequences; 

(g) choosing the recognized speech unit sequence that has a highest probability relative 

to probabilities of the other recognized speech unit sequences; and 

(h) identifying the caller based at least in part on the recognized speech unit sequence 

having the highest probability. 

1 1. (Currently Amended) The method of claim 10 and further comprising: 

(i) if the caller identified in step (h) is different than the caller identified in step (c), 

generating a user prompt for a manual r e vi e w of at least one of the following: the 
voice input, the recognized speech unit sequence, the identified callers, the 



-7- 



acoustic model of the caller identified in step (c), and of the caller-specific 
language model of the caller identified in step (h). 

12. (Original) The method of claim 1 and further comprising: 

(e) using a distance measure between the plurality of acoustic models of the previously 
identified callers to flag certain acoustic models for merging together. 

13. (Currently Amended) The method of claim 12 wherein step (e) comprises flagging 
generating a user prompt, which identifies the certain acoustic models for manual insp e ction . 

14. (Currently Amended) A system for identifying a caller of a call from the caller to a 
recipient, the system comprising: 

a receiver for receiving a voice input from the caller; 

an acoustic model repository comprising a plurality of acoustic models, including a 
generic acoustic model and acoustic models of any previously identified callers^ 
wherein the generic acoustic model comprises caller-independent models of a 
plurality of speech units ; 

means for applying characteristics of the voice input to the plurality of acoustic models to 
produce a plurality of respective acoustic scores; 

means for identifying the caller as one of the previously identified callers or a new caller 
based on the plurality of acoustic scores; and 

acoustic model generator means for generating a new acoustic model for the new caller if 
the acoustic score for the generic acoustic model is better than the acoustic scores 
for the acoustic models of the plurality of previously identified callers , wherein 
the acoustic model generator means generates the new acoustic model from the 
caller-independent models of the generic acoustic model and modifies the caller- 
independent models of the speech units that are included in the voice input to 
represent the characteristics of the voice input received from the new caller such 
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that the new acoustic model for the new caller and the acoustic models of any 
previously identified callers comprise the same plurality of speech units . 

15. (Original) The system of claim 14 and further wherein: 

the system further comprises a speech recognizer, which segments the voice input into a 

sequence of recognized speech units using the generic acoustic model; 
each of the plurality of acoustic models comprises models of the speech units recognized 

by the speech recognizer; and 
the means for applying comprises means for applying the characteristics of the voice 

input to a sequence of the models of the speech units segmented by the speech 

recognizer for the plurality of acoustic models. 

16. (Currently Amended) The system of claim 14 wherein: 

e ach of th e plurality of acoustic mod e ls compris e s mod e ls of sp e ech units; and 
the system further comprises an acoustic model updating module, which if the caller is 
identified as one of the previously identified callers, updates the respective 
acoustic model for the previously identified caller by modifying the models of the 
speech units that are included in the voice input, based on the characteristics of 
the voice input. 

17. (Original) The system of claim 16 wherein the acoustic model updating module is capable of 
modifying the models of the speech units that are included in the voice input based on as little as 
a single utterance from the caller. 

18. (Original) The system of claim 14 wherein the acoustic model generator means stores the 
new acoustic model in the acoustic model repository such that the new acoustic model becomes 
one of the plurality of acoustic models and the new caller is included as a previously identified 
caller. 



19. (Canceled). 



20.(Original) The system of claim 14 wherein system is configured to receive the voice input 
and identify the caller without alerting the caller during the call that the caller is being identified. 

21 .(Original) The system of claim 14 wherein: 

the means for applying comprises means for splitting the voice input into subsections and 
applying the characteristics of each subsection to the plurality of acoustic models 
to obtain a plurality of respective acoustic scores that represent how well the 
characteristics in each subsection match the respective acoustic models; and 

the means for identifying comprises, for each subsection, means for identifying the 
acoustic model having the best acoustic score for that subsection and means for 
identifying the caller as one of the previously identified callers only if the best 
acoustic scores for all subsections correspond to the same previously identified 
caller. 

22. (Original) The system of claim 14 and further comprising: 

a language model repository for storing a caller-specific language model for each of the 

previously identified callers based on the voice inputs from those callers; 
means for applying the characteristics of the voice input to the generic acoustic model and 

each of the caller-specific language models to produce a plurality of recognized 

speech unit sequences; and 
means for choosing the recognized speech unit sequence that has a highest probability 

relative to probabilities of the other recognized speech unit sequences, wherein the 

means for identifying identifies the caller based at least in part on the recognized 

speech unit sequence having the highest probability. 



-10- 



23. (Currently Amended) The system of claim 22 wherein the means for identifying comprises 
means for generating a user prompt for a manual r e vi e w of at least one of the following: (1) the 
voice input, the recognized speech unit sequence having the highest probability, (2) the caller- 
specific language model producing the recognized speech unit sequence having the highest 
probability, and (3) the acoustic model having the best acoustic score, if the caller-specific 
language model having the highest probability corresponds to a different caller than the acoustic 
model having the best acoustic score in (3). 

24. (Original) The system of claim 14 and further comprising: 

means for flagging certain acoustic models for merging together based on a distance 
measure between the plurality of acoustic models. 

25. (Currently Amended) The system of claim 24 wherein the means for flagging comprises 
means for flagging generating a user prompt, which identifies the certain acoustic models fef 
manual insp e ction . 

26. (Currently Amended) A computer-readable medium comprising computer-executable 
instructions that, when executed by a computer, performs the method comprising: 

(a) receiving a voice input of a call from a caller; 

(b) applying characteristics of the voice input to a plurality of acoustic models, which 

comprises a generic acoustic model and acoustic models of any previously 
identified callers, to obtain a plurality of respective acoustic scores that represent 
how well the characteristics match the respective acoustic models , wherein the 
generic acoustic model comprises caller-independent models of a plurality of 
speech units ; and 

(c) identifying the caller as one of the previously identified callers or a new caller based 

on the plurality of acoustic scores; and 

(d) if the caller is identified as a new caller in step (c), generating a new acoustic model 
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for the new calle r, which is sp e cific to th e n e w call e r from the caller-independent 
models of the generic acoustic model and modifies the caller-independent models 
of the speech units that are included in the voice input to represent the 
characteristics of the voice input received from the new caller such that the new 
acoustic model for the new caller and the acoustic models of any previously 
identified callers comprise the same plurality of speech units . 

27. (Original) The computer-readable medium of claim 26 wherein: 

step (a) comprises segmenting the voice input into a sequence of recognized speech units 

using the generic acoustic model; 
each of the plurality of acoustic models comprises models of the speech units segmented 

in step (a); and 

step (b) comprises applying the characteristics of the voice input to a sequence of the 
models of the speech units segmented in step (a) for the plurality of acoustic 
models. 

28. (Currently Amended) The computer-readable medium of claim 26 wh e r e in e ach of th e 
plurality of acoustic models compris e s mod e l s of sp ee ch units and wherein the method further 
comprises: 

(e) if the caller is identified as one of the previously identified callers in step (c), updating 
the respective acoustic model for the previously identified caller by modifying the 
models of the speech units that are included in the voice input, based on the 
characteristics of the voice input. 

29. (Original) The computer-readable medium of claim 26 wherein the method further 
comprises: 

(e) storing the new acoustic model in an acoustic model repository with the plurality of 
acoustic models such that the new acoustic model becomes one of the plurality of 



-12- 



acoustic models in step (b) and the new caller is included as a previously 
identified caller. 

30. (Canceled). 

31. (Original) The computer-readable medium of claim 26 wherein the method further 
comprises: 

(e) maintaining a caller-specific language model for each of the previously identified 

callers; and 

(f) identifying the caller based at least in part on probabilities of recognized speech unit 

sequences produced by the caller-specific language models from the voice input. 

32. (Currently Amended) The computer-readable medium of claim 31 wherein the method further 
comprises: 

(g) if the caller identified in step (f) is different than the caller identified in step (c), 

generating a user prompt for a manual revi e w of at least one of the following: the 
voice input, the recognized speech unit sequence, the identified callers, the 
acoustic model of the caller identified in step (c), and of the caller-specific 
language model of the caller identified in step (f). 

33. (Original) The computer-readable medium of claim 26 wherein the method further 
comprises: 

(e) using a distance measure between the plurality of acoustic models of the previously 
identified callers to flag certain acoustic models for merging together. 



34. (Currently Amended) The computer-readable medium of claim 33 wherein step (e) 
comprises flagging generating a user prompt, which identifies the certain acoustic models for 
manual insp e ction . 
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35.(Currently Amended) A computer-implemented method of identifying a caller of a call 
from the caller to a recipient, the method comprising: 

(a) receiving a voice input; 

(b) segmenting the voice input into a sequence of recognized speech units using a caller- 

independent, generic acoustic model , wherein the generic acoustic model 
comprises caller-independent models of a plurality of speech units ; 

(c) applying characteristics of the voice input to a sequence of speech unit models of the 

recognized speech units within a plurality of acoustic models, which comprises 
the generic acoustic model and acoustic models of any previously identified 
callers; and 

(d) identifying the caller as one of a plurality of previously identified callers or as a new 

caller based on how well the characteristics of the voice input fit the plurality of 
acoustic models , and if the caller is identified as a new caller, generating a new 
acoustic model for the new caller from the generic acoustic model by modifying 
the speech unit models of the speech units that are included in the voice input to 
represent the characteristics of the voice input received from the new caller such 
that the new acoustic model for the new caller and the acoustic models of any 
previously identified callers comprise the same plurality of speech units . 



36.(Currently Amended) The method of claim 35 and further comprising: 

(e) if th e call e r is id e ntifi e d as a n e w caller in st e p (d), g e n e rating a n e w acoustic mod e l 

for th e n e w caller from th e gen e ric acoustic mod e l by modifying th e sp ee ch unit 
mod e ls of th e sp ee ch units that ar e includ e d in th e voice input to r e pr e s e nt th e 
charact e ristics of th e voic e input r e c e iv e d from the n e w call e r; and 

(f) storing the new acoustic model in an acoustic model repository with the plurality of 

acoustic models such that the new acoustic model becomes one of the plurality of 
acoustic models in step (c) and the new caller is included as a previously 
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identified caller. 

37.(Currently Amended) The method of claim 35 and further comprising: 

(e) maintaining a caller-specific language model for each of the previously identified 

callers based on the voice inputs from those callers; 

(f) applying the characteristics of the voice input to the generic acoustic model and each 

of the caller-specific language models to produce a plurality of recognized speech 
unit sequences; 

(g) choosing the recognized speech unit sequence that has a highest probability relative 

to probabilities of the other recognized speech unit sequences; 

(h) identifying the caller based on the recognized speech unit sequence having the 

highest probability; and 

(i) if the caller identified in step (h) is different than the caller identified in step (d), 

generating a user prompt for a manual r e vi e w of at least one of the following: the 
voice input, the recognized speech unit sequence, the identified callers, the 
acoustic model of the caller identified in step (d), and of the caller-specific 
language model of the caller identified in step (h). 



38. (Previously Presented) The method of claim 35 wherein the method further comprises: 

(e) using a distance measure between the plurality of acoustic models of the previously 
identified callers to flag certain acoustic models for merging together. 



